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Abstract. Motivated by issues arising in computer science, we inves- 
tigate the loop-free paths from the identity transformation and corre- 
sponding straight words in the Cayley graph of a finite transformation 
semigroup with a fixed generator set. Of special interest are words that 
permute a given subset of the state set. Certain such words, called min- 
imal permutators, are shown to comprise a code, and the straight ones 
comprise a finite code. Thus, words that permute a given subset are 
uniquely factorizable as products of the subset's minimal permutators, 
and these can be further reduced to straight minimal permutators. This 
leads to insight into structure of local pools of reversibility in transfor- 
mation semigroups in terms of the set of words permuting a given subset. 
These findings can be exploited in practical calculations for hierarchical 
decompositions of finite automata. As an example we consider groups 
arising in biological systems. 

1 Introduction 

From the computational perspective it is very important to know how a par- 
ticular element of a transformation semigroup can (efficiently) be generated. Of 
special interest are elements of the semigroup that permute a subset of the state 
set, as the hierarchical decomposition of the semigroup [8] depends on the group 
components [1,2]. Here we study the ways in which a particular transformation 
can be expressed without any redundancy. These generator words head towards 
the target transformation without any repetitions, so they are called straight. 
Straight words also encode the information describing all possible ways that 
particular semigroup element can be generated. 



1.1 Notation 

For a finite transformation semigroup (X, S) we fix a generator set of trans- 
formations T = {ti, . . . , t n }, so S = (T). We also consider the generators as 
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symbols, thus a finite product of the generator elements becomes a word in T + 
(the free semigroup on generators T whose associative binary operation is con- 
catenation). It is then convenient to consider the empty word e as the identity 
map. We need to distinguish between the word (often thought of as a sequence 
of input symbols) and the transformation it realizes: for the word we just write 
the generator symbols in sequence ... t in G T + while the transformation is 
denoted by . . . ti k G S, where the arrow indicates the order in which the 
generator elements are multiplied and emphasizes that is a mapping. 

For transformations, we either use the usual 2-line notation for mappings, or 
if it would become too space consuming we apply the linear notation suggested 
in [5]. This is a natural extension of the cyclic notation of permutations. Con- 
sidering the mappings as digraphs, each transformation consists of one or more 
components. Each component contains a cycle (possibly a trivial cycle). Unlike 
the permutation case, the points in the cycle can have incoming edges, denoted 

by 

[ sourcei , . . . , source m ; target ] 

where target is the point in the cycle. If a source point also has incoming edges 
from other points the same square bracket structure is applied again recursively. 
We can say that the points in the cycle are sinks of trees. Therefore the brackets 
indicate the existence of a nontrivial permutation of the sink elements of the 
trees, but not of their sources: 

([ sourcesi; targe^ ],•■-,[ sources/*,-; target fc ]). 

This corresponds to the cycle ( target^..., target^) but at the same time it 
contains information on transient states. The order is arbitrary if there are more 
than one component. (See below for examples.) 1 

2 Straight Words 

If the goal is to generate a transformation s G S as quickly as possible without 
any digression, then in each step of the generation a new transformation should 
appear. Also, if a prefix generates the identity map, so strictly speaking we did 
nothing so far, then the prefix can be discarded. More precisely, 

Definition 1 (Straight Words). Let s G S be a transformation generated by 
the word . . . U m G T + , so s — . . . t% m , then this word is straight if 

Ui ■ ■ ■ Ul ^ e , k G {1, . . . , m - 1} (1) 

and 

t~77. til = *ii =*> k = 1 (l<k,l<m). (2) 

1 Our notation is slightly different from [5] as we do not use square brackets for a 
singleton source. 



Example 1 ( Cyclic (monogenic) semigroup) . Let X = {1, 2, 3, 4} and t = (2412)' 
or, in the alternative notation, t = ([[3; 1];2],4). The semigroup generated by t 
is 

(t) = {£ = (| 4 1 2)1^ = (4224)1' = (2442)}' 

t, t 2 and t 3 arc straight words, but these are the only ones. Higher powers, like 
t A = t 2 already contain repeated transformations. This example shows that being 
straight is not necessarily connected to the formal notion of containing repeated 
subwords. 

Example 2 (Cyclic group). Let g = (1,2,3) be a permutation, then g 3 = e is a 
straight word producing the identity map. This example justifies condition 1 in 
Definition 1, as we allow the identity transformation at the end of a word, but 
not inside. 

2.1 Alternative defintion using trajectories 

Definition 2 (Trajectory). Let si,...,s n be a sequence of semigroup ele- 
ments, Sj G S . Then the sequence is a trajectory if for all Sj, 1 < j < n there is 
a generator ti G T such that Sj ■ ti = Sj+i . 

A trajectory is a path in the Cayley graph of the semigroup starting at the trivial 
transformation. We can associate a trajectory with a word. 

Definition 3 (Trajectory of a word). Given a wordt^ . . . ti m the correspond- 
ing trajectory is calculated by taking the products of prefixes: 

r> rr^ i- — 

With trajectories we can obviously give an alternative definition of straight 
words. 

Definition 4 (Straight words). A word is straight if all the elements of its 
trajectory are distinct, except the case of loops when the first and the last element 
coincide (and equal e). 

2.2 Straight words and transformations 

From finiteness it follows that the straight words cannot be extended beyond 
some finite length, since there are finitely many elements of the semigroup and 
each prefix should realize a distinct semigroup element. An obvious bound on 
the length of the straight words is |5|. This bound is reached in Example 2. We 
also observe that all semigroup elements can be realized by a straight word. 

Lemma 1. Let (X, S) be a transformation semigroup with states X and semi- 
group S generated by T . Each semigroup element s G S can be realized by a 
straight word in the letters of T . 



Proof. Let s = . . . tj TO . If . . . i, m is not straight then there is k ^ I such that 
ti x . . . ti k =ti x ... U t . Suppose that k < I. Then the product U x . . . ti k ti l+1 . . . U m 
still generates s, after we cut out U k+1 . . . U t . 

Similarly, in case an identity appears at some position (not the final one) in a 
trajectory then the whole prefix can be ignored up to that point. If the reduced 
word is not straight then we can repeat either processes. Due to finiteness this 
method will stop, and thus produce a straight word generating s. □ 

Another way to see that there is at least one straight word for each transfor- 
mation is to observe that the first occurences (but not the subsequent ones) of 
transformations in a breadth-first generation of S by T are produced by straight 
words. 

Corollary 1. Any minimal length word generating s 6 S is a straight word. 

We have seen that for each semigroup element we can give at least one straight 
generator word. The following example shows that there can be more straight 
words for a mapping. 

Example 3 (Constant Maps). Let ti = ( \ \ ) and t-z = (22) be two generators, 
then t\ and £2^1 are each straight words for t\ , while ti and t\ti are straight 
and both realize ti . Constant maps render the transformations before them 
negligible. 

2.3 Synonym Straight Words 

Different straight words may represent the same transformation. For example, if 
we add a second generator, ^=(4224)^° Example 1, then clearly r and t 2 are 
words with this properties. Moreover, two different words may have the same 
trajectories. 

2.4 Generalization: Straight Paths 

We can study straight words in a more general settings, we look for straight 
words w = . . . ti m such that s • w = r, where r,sGS. Actually these arise as 
labels of 'straight paths' in the Cayley graph of the semigroup between nodes s 
and r, i.e. simple paths that do not cross themselves but go directly from s to 
r. We get the special case of straight words when s = e. 

2.5 Computational Implementation 

Computational enumeration of straight words can easily be done with a back- 
track algorithm. Wc implemented the search algorithm in the SgpDec software 
package [4] in the GAP computer algebra system [6]. 



3 Minimal Straight Words and Permutations of Subsets 



From now on we focus on straight words that induce permutations on a subset 
of the state set. The full permutator semigroup Perm(Y) for a subset Y C X in 
(X, S) is 

Perm{Y) = {s G S : Y ■ s = Y}. 

Elements of Perm(Y) are called also permutators ofY. Perm(Y) is closed under 
products, so by finiteness it restricts to a group of permutations acting on Y. 
The restrictions of elements of Perm(Y) to Y thus comprise a permutation 
group or 'pool of reversibility' or 'natural subsystem' within the transformation 
semigroup (X, S). However, while any s £ Perm(Y) is also defined on all of X it is 
not generally a permutation of X. The elements of PermiY) may agree on Y but 
disagree on X\Y, so PermiY) may not itself be a group nor act faithfully on Y . 
We also call a word a permutator word if it realizes a permutator transformation. 

Example 4 (Cyclic uniquelly labelled digraph as an automaton). The genera- 
tor set consists of 3 elementary collapsings, T = {a = 1 h-> 2, 6 = 2^3, 
c = 3 h> 1}. The generated semigroup has 21 elements and, in the notation 
introduced above, the straight words of the semigroup elements are: 



[3;i] 


c 


[2:3] 


b 


[1:2] 


a 


[[2;3];1] 


cb 


[[i;2];3] 


ba 


[[3;i]; 2] 


ac 


([i;2],3) 


cba 


([3;i],2) 


bac 


(U2;3]) 


acb 


(1,[3;2]) 


cbac 


([2;i],3) 


bacb 


Qi;3],2) 


acba 


[[2;i];3] 


cbacb 


[[i;3];2] 


bacba 


[[3;2];1] 


acbac 


[i;3] 


cbacba 


[3:2] 


bacbac 


[2:1] 


acbacb 



plus the constant maps that are represented by a lot more straight words: 

[1,3; 2] abca, aca, acbabca, acbaca, acbacbca, acbca, babca, baca, bacbabca, 

bacbaca, baebca, bca, ca, cbabca, cbaca, cbacbabca, cbacbca, cbca 

[2,3;1] abc, acabc, acbabc, acbacabc, acbaebe, aebe, babe, bacabc, bacbabc, 

bacbacabc, baebe, be, cabc, cbabc, cbacabc, cbacbabc, cbaebe, cbc 

[1, 2; 3] ab, acab, acbab, acbacab, acbaebcab, aebcab, bab, bacab, bacbab, bacbacab, 

baebcab, bcab, cab, cbab, cbacab, cbacbab, cbaebcab, cbcab 



Now let Y = {1,2}. Then there are exactly 4 straight words permuting Y, 
bac and cbac realizing the transposition (1 2), and c and bacbac realizing the 



identity, thus { c , bac, cbac, bacbac} C Perm(Y). Note that two of these words 
are products of two of the others. If we extend the search also for words that are 
not straight we can find more permutators. For example baac = bbac = ([3; 1], 2). 

A word it; is a minimal permutator of Y if w represents an element of Perm(K) 
and w is not a product of two or more words permuting Y . That is, w ^ w\W2 
for any words w\ and wi representing elements of Pcrm(Y). The set of minimal 
permutators is not necessarily finite, as we can use idempotcnts to "pump in the 
middle" like ba n c in Example 4. Therefore we turn our attention to the set of 
minimal straight permutators denoted by Ms(Y). 

Fact 1 The set Ms(Y) of minimal straight permutator words for Y is finite. 

Proof. The assertion easily follows from the fact that straight words are bounded 
in length. □ 

Now we need to show that we do not lose anything by discarding the words that 
are not straight, i.e. we can still generate the full permutator semigroup. We will 
use the following obvious fact. 

Fact 2 If w — uv permutes Y and u permutes Y , then v permutes Y. 

Theorem 1. In the free semigroup T + on the generators of S , the minimal per- 
mutators M(Y) of Y generate the subsemigroup of all words realizing elements 
o/Perm(Y). That is, 



Moreover, the minimal straight permutators Ms(Y) of Y generate a subsemi- 
group of words realizing all elements of Perm(Y). 

Proof. Let p = t\ . . . t& represent an element of Perm(Y'). We show p is a product 
of minimal permutators by induction on k. Either p is a minimal permutator or 
there is a least j strictly less than k so that t\ . . . tj permutes Y. Now t\ . . . tj 
is a minimal permutator of Y and p = (t\ . . .tj)(tj+i . . . ifc) with each of the 
expressions in parentheses permuting Y. The length of the second word is strictly 
less than k, so by induction hypothesis, it too can be written as a product of 
minimal permutators of Y. This proves that an arbitrary word p representing 
an element of Perm(K) can be factored as a product of minimal permutators 
of Y. Each minimal permutator factor can be shortened by removing letters if 
necessary to a straight word (or the empty word). The result follows. □ 




(M(Y)) = all words representing elements of Perm(Y). 



Theorem 2. Any word w representing a permutator ofY can be factored uniquely 
into a product of minimal permutators ofY. 



Proof. By the previous theorem, we can write 



w = wi ■ ■ ■ Wk, 

where each un is a minimal permutator word of Y. Suppose w can also be written 

as 

w = w[ . . . w' e , 

where again each w\ represents is a minimal permutator word for Y. We show 
£ = k and Wj = w'j for all j (1 < j <£).lf this were not the case, then let i be the 
least index such that Wi ^ w\. Without loss of generality, assume |tOi| < |u>-|. It 
follows then that w\ = WiV for some nonempty word v. By Fact 2, v represents a 
permutator of Y. But we have then written w[ as a product of permutator words, 
this contradicts the choice of w[ as a minimal permutator. It follows W{ = w[ for 
all i, and, since the two factorizations are of the same word, that £ = k. □ 

In other words, 

Corollary 2. The minimal permutator words are a code. 

Corollary 3. The minimal permutator straight words are a finite code. 

The last corollary shows the usefulness of straight words, when looking for per- 
mutators instead of an infinite search space we can restrict the search to a finite 
set of words. 

Fact 3 For minimal permutator word w, there is a (in general non-unique) 
straight minimal permutator word red(w) obtained from w by removing some 

letters such that wt = red(w] . 

Proof. Considering the Cayley graph of the transformation semigroup (X, S) 
with generators T. This has vertices S 1 =SU{e}, where e denotes the identity 
mapping on X, and edges s —> s' , where s' = s t with t £ T, s, s' € S 1 . Now, 
by the alternative definition of straight words, it is clear the a word is straight 
if and only if the path it labels starting at e and has no loop (does not visit 
any node more than once). Noting that adding or removing loops to the path 
corresponding to a product does not change its endpoint, we conclude that re- 
moving contiguous subwords from the word w corresponding to loops, iteratively 
if necessary, results in a path with no loops, corresponding to a straight word w' 
representing the same transformation as w. □ 

Theorem 3. There is a well-defined homomorphism (j) '■ M + {Y) -» Ms(Y) + 
from the semigroup of permutator words onto the semigroup generated by mini- 
mal straight permutator words. Furthermore, <j) is a retraction, i.e. <j){w) = w for 
all words w in Ms{Y) (and hence is the identity on Ms{Y) + ). The permutator 
words w and 4>{w) act by the same permutation ofY, and moreover by the same 
mapping on X , and <f)(w) is a straight word obtained from w having the same 
trajectory as w except for the removal of loops. 



Proof. To get a well-defined homomorphism from minimal permutator code to 
the straightword minimal permutator code, one only needs to choose some reduc- 
tion for each minimal permutator (any reduction at all would work). The reason 
why one gets a homomorphism is due to that fact the minimal permutators are 
a code, hence free generators of a free semigroup, so we need only say where 
each generator goes and extend uniquely by freeness. □ 

The reduction of a minimal permutator to a straight word need not be unique. 
This comes from the fact that synonym straight words do exist. Thus the homo- 
morphism of the theorem need not be unique. 

One natural way to choose the reduction red(w) is the following: given a 
minimal permutator word w that is not straight, find the first node (along its 
trajectory) that is later repeated. Start deleting letters after the letter that first 
takes us into this node. Find the last time this node occurs. Delete all letters 
from there up to and including the one taking us into the node for the last time. 
This process removes at least one letter since the word was not straight. Repeat 
the procedure until the resulting word is straight. This necessarily terminates 
with a reduced form for w, realizing the same transformation by a straight 
word obtained from uu by some excising some subwords ('removing loops' in the 
trajectory as described). 



4 A Biological Example 



It seems that in constructing interesting examples the human mind is somewhat 
contrained and reverts back to special cases. Therefore studying "naturally oc- 
curing" transformation semigroups can be useful, so here we investigate a biolog- 
ical example. We should also mention that in exchange semigroup and automata 
theory can also provide useful tools for other sciences [9]. 



4.1 The p53-mdm2 regulatory pathway 

Biological networks are frequently modelled by Petri nets and thus it is not 
difficult to convert such a model to a transformation semigroup [3]. Figure 1 
shows such a model of the p53-mdm2 regulatory pathway, which is important 
in the cellular response to ionizing radiation and can trigger self-repair or, in 
extreme cases, the onset of programmed cell-death (apoptosis). This pathway 
is involved in ameliorating DNA damage and preventing cancer [7]. Figure 2 
shows the corresponding finite automata with 16 states in which two levels of 
each of the 4 molecular species involved are distinguished. Corresponding to the 



tl 




t4 



Fig. 1. Petri net for the p53-mdm2 regulatory pathway. P = p53, M = 
mdm2, C = p53-mdm2, R = p53*. 

transitions we have the following generator transformations: 

h = [1; 2] [3; 4] [5; 6] [7; 9] [8; 10] [11; 12] [13; 14] [15; 16] 

t 2 = [2; 1] [4; 3] [6; 5] [9; 7] [10; 8] [12; 11] [14; 13] [16; 15] 

t 3 = [1; 3] [2; 4] [5; 7] [6; 9] [8; 11] [10; 12] [13; 15] [14; 16] 

U = [3; 1] [4; 2] [7; 5] [9; 6] [11; 8] [12; 10] [15; 13] [16; 14] 

h = [4, 12; 8] [9, 16; 13] 

t e = [2, 6; 5] [4, 9; 7] [10, 14; 13] [12, 16; 15] 

t 7 = [5, 6; 2] [7, 9; 4] [13, 14; 10] [15, 16; 12] 

t 8 = [8, 11; 3] [10, 12; 4] [13, 15; 7] [14, 16; 9] 

to = [5; 1] [6; 2] [7; 3] [9; 4] [13; 8] [14; 10] [15; 11] [16; 12] 

4.2 Analysis of a permutator subsemigroup 

None of the above generators contain a cycle, so the existence of a nontrivial 
permutation group cannot be simply read off. The generated semigroup has 
316665 elements. The decomposition of the semigroup shows has (several copies 
of) the following group components: cyclic group C2 acting on 4, symmetric 
group 53 acting on 3 and Ci acting on 2 states. 



Fig. 2. Automaton derived from 2-level Petri net of the p53 system (16 
states). The labels on the nodes encode the possible configurations for 
M, C, P and R (in this order). denotes the absence (or presence be- 
low a threshold), 1 the presence (above the threshold) of the given type of 
molecule. For instance, 0101 means that C and R are present. The shaded 
states correspond to the state set {3 = 1000, 5 = 0001, 8 = 0100}. 

Wc pick the set {3, 5, 8} (there are many 3-element subsets that are mutually 
reachable from each other under the action of the semigroup, therefore they have 
isomorphic permutator groups). Computer calculation shows |Perm({3, 5, 8})| = 
542. Consider the following words of length 13 and 15, found by a breadth-first 
search, 

realizing transformations 

~ct =([1, 2, 10; 3], [4, 7, 9, 11, 12, 15, 16; 5], [6, 13, 14; 8]) 
~t =[1, 2,4; 3] ([10, 11, 12, 13, 14, 15, 16; 5], [6, 7,9; 8]). 

As highlighted, these are clearly permutator words for the set {3, 5, 8} and gener- 
ate S3. It is straightforward to verify that these two words are straight. Moreover, 



a and b can be checked to be minimal permutators (i.e. they cannot be properly 
factored into permutators of {3, 5, 8}). However, the idempotent powers of these 
words bb* and aaa are not equal, so the transformations do not lie in the same 
subgroup of the semigroup of the automaton. 

We derive from these words, x = bbabb (a word with 73 letters), which reduces 
to straight word: bba, (with only 43 letters), giving the transformation 

it = ([1, 2, 10; 5], [6, 13, 14; 8])[4, 7, 9, 11, 12, 15, 16; 3] 

and y = aaabaaa, (another long word with 93 letters), which reduces to straight 
word aaab (with 54 letters), giving the transformation 

f = ([1, 2, 10; 5], [6, 13, 14; 8])[4, 7, 9, 11, 12, 15, 16; 3]. 

These words (a, b, aaab, bba) are all straight permutator words, but obviously 
aaab and bba are not minimal permutators since they are products of (straight) 
minimal permutators a and b. 

We have two copies of group the symmetric group S3 each faithfully acting on 
{3,5,8} = {M, R, C}: one S3 is generated by a and y, and another isomorphic 
copy of 53 by b and x with idempotents (the identity elements of these two 

groups): a? = y^ = [1, 2, 10; 8] [4, 7, 9, 11, 12, 15, 16; 3] [6, 13, 14; 5] and ~$ = a? = 
[1, 2, 4; 3] [6, 7, 9; 5] [10, 11, 12, 13, 14, 15, 16; 8], respectively 

Together the elements ~ct and b generate a 12 clement semigroup which is 
just the union of these two groups. This is to some extent counterintuitive, as one 
would expect one copy of the permutator group for one particular subset of the 
state set; furthermore as mentioned above the permutator semigroup PermiY) 
has, not just these 12, but 542 elements, and this is but one of many instances 
of sets of states in this biological model acted on by the symmetric group S3. 

5 Conclusion 

Based on algorithmic efficiency considerations we studied straight words that 
encode loop-free paths in the Cayley graph of a transformation semigroup. We 
focused on straight words generating transformations that permute a given sub- 
set of the state set. We found that these minimal permutator straight words form 
a finite code, and also the minimal permutator words form a code, although, as 
easy examples show, the latter is generally an infinite code. The minimal permu- 
tator straight words generate the corresponding subgroup of the transformation 
semigroup. These can be exploited in the calculations of hierarchical decompo- 
sitions. These findings show that there are lot more ways within a semigroup 
to generate a subgroup than one might think, but for finding the subgroup it is 
enough to consider a subset of them. 
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